NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Operating the 200 Gbps IRIS-HEP Demonstrator for ATLAS

https://doi.org/10.1051/epjconf/202533701061

Gardner_Jr, Robert W; Benjamin, Douglas; Bryant, Lincoln; Feickert, Matthew; Golnaraghi, Farnaz; Held, Alexander; Hu, Fengping; Jordan, David; Stephen, Judith; Vukotic, Ilija; et al (October 2025, EPJ Web of Conferences)
Szumlak, T; Rachwał, B; Dziurda, A; Schulz, M; vom_Bruch, D; Ellis, K; Hageboeck, S (Ed.)
The ATLAS experiment is currently developing columnar analysis frameworks which leverage the Python data science ecosystem. We describe the construction and operation of the infrastructure necessary to support demonstrations of these frameworks, with a focus on those from IRIS-HEP. One such demonstrator aims to process the compact ATLAS data format PHYSLITE at rates exceeding 200 Gbps. Various access configurations and setups on different sites are explored, including direct access to a dCache storage system via Xrootd, the use of ServiceX, and the use of multiple XCache servers equipped with NVMe storage devices. Integral to this study was the analysis of network traffic and bottlenecks, worker node scheduling and disk configurations, and the performance of an S3 object store. The system’s overall performance was measured as the number of processing cores scaled to over 2,000 and the volume of data accessed in an interactive session approached 200 TB. The presentation will delve into the operational details and findings related to the physical infrastructure that underpins these demonstrators.
more » « less
Full Text Available
The 200 Gbps Challenge: Imagining HL-LHC analysis facilities

https://doi.org/10.1051/epjconf/202533701217

Held, Alexander; Albin, Sam; Attebury, Garhan; Bloom, Kenneth; Bockelman, Brian; Bryant, Lincoln; Choi, Kyungeon; Cranmer, Kyle; Elmer, Peter; Feickert, Matthew; et al (October 2025, EPJ Web of Conferences)
Szumlak, T; Rachwał, B; Dziurda, A; Schulz, M; vom_Bruch, D; Ellis, K; Hageboeck, S (Ed.)
The IRIS-HEP software institute, as a contributor to the broader HEP Python ecosystem, is developing scalable analysis infrastructure and software tools to address the upcoming HL-LHC computing challenges with new approaches and paradigms, driven by our vision of what HL-LHC analysis will require. The institute uses a “Grand Challenge” format, constructing a series of increasingly large, complex, and realistic exercises to show the vision of HL-LHC analysis. Recently, the focus has been demonstrating the IRIS-HEP analysis infrastructure at scale and evaluating technology readiness for production. As a part of the Analysis Grand Challenge activities, the institute executed a “200 Gbps Challenge”, aiming to show sustained data rates into the event processing of multiple analysis pipelines. The challenge integrated teams internal and external to the institute, including operations and facilities, analysis software tools, innovative data delivery and management services, and scalable analysis infrastructure. The challenge showcases the prototypes — including software, services, and facilities — built to process around 200 TB of data in both the CMS NanoAOD and ATLAS PHYSLITE data formats with test pipelines. The teams were able to sustain the 200 Gbps target across multiple pipelines. The pipelines focusing on event rate were able to process at over 30 MHz. These target rates are demanding; the activity revealed considerations for future testing at this scale and changes necessary for physicists to work at this scale in the future. The 200 Gbps Challenge has established a baseline on today’s facilities, setting the stage for the next exercise at twice the scale.
more » « less
Full Text Available
Differentiable Programming: Neural Networks and Selection Cuts Working Together

https://doi.org/10.1051/epjconf/202429509011

Watts, Gordon (May 2024, EPJ Web of Conferences)
De_Vita, R; Espinal, X; Laycock, P; Shadura, O (Ed.)
Differentiable Programming could open even more doors in HEP analysis and computing to Artificial Intelligence/Machine Learning. Current common uses of AI/ML in HEP are deep learning networks – providing us with sophisticated ways of separating signal from background, classifying physics, etc. This is only one part of a full analysis – normally skims are made to reduce dataset sizes by applying selection cuts, further selection cuts are applied, perhaps new quantities calculated, and all of that is fed to a deep learning network. Only the deep learning network stage is optimized using the AI/ML gradient decent technique. Differentiable programming offers us a way to optimize the full chain, including selection cuts that occur during skimming. This contribution investigates applying selection cuts in front of a simple neural network using differentiable programming techniques to optimize the complete chain on toy data. There are several well-known problems that must be solved – e.g., selection cuts are not differentiable, and the interaction of a selection cut and a network during training is not well understood. This investigation was motived by trying to automate reduced dataset skims and sizes during analysis – HL-LHC analyses have potentially multi-TB dataset sizes and an automated way of reducing those dataset sizes and understanding the trade-offs would help the analyser make a judgement between time, resource usages, and physics accuracy. This contribution explores the various techniques to apply a selection cut that are compatible with differentiable programming and how to work around issues when it is bolted onto a neural network. Code is available.
more » « less
Full Text Available
How the Scientific Python ecosystem helps answer fundamental questions of the Universe

https://doi.org/10.25080/KMXN4784

Feickert, Matthew; Hartmann, Nikolai; Heinrich, Lukas; Held, Alexander; Kourlitis, Vangelis; Krumnack, Nils; Stark, Giordon; Vigl, Matthias; Watts, Gordon (July 2024, proceedings.scipy.org)

The ATLAS experiment at CERN explores vast amounts of physics data to answer the most fundamental questions of the Universe.The prevalence of Python in scientific computing motivated ATLAS to adopt it for its data analysis workflows while enhancing users’ experience.This paper will describe to a broad audience how a large scientific collaboration leverages the power of the Scientific Python ecosystem to tackle domain-specific challenges and advance our understanding of the Cosmos.Through a simplified example of the renowned Higgs boson discovery, attendees will gain insights into the utilization of Python libraries to discriminate a signal in immersive noise, through tasks such as data cleaning, feature engineering, statistical interpretation and visualization at scale.
more » « less
Full Text Available
hep_tables: Heterogeneous Array Programming for HEP

https://doi.org/10.1051/epjconf/202125103061

Watts, Gordon (January 2021, EPJ Web of Conferences)
Biscarat, C.; Campana, S.; Hegner, B.; Roiser, S.; Rovelli, C.I.; Stewart, G.A. (Ed.)
Array operations are one of the most concise ways of expressing common filtering and simple aggregation operations that are the hallmark of a particle physics analysis: selection, filtering, basic vector operations, and filling histograms. The High Luminosity run of the Large Hadron Collider (HL-LHC), scheduled to start in 2026, will require physicists to regularly skim datasets that are over a PB in size, and repeatedly run over datasets that are 100’s of TB’s – too big to fit in memory. Declarative programming techniques are a way of separating the intent of the physicist from the mechanics of finding the data and using distributed computing to process and make histograms. This paper describes a library that implements a declarative distributed framework based on array programming. This prototype library provides a framework for different sub-systems to cooperate in producing plots via plug-in’s. This prototype has a ServiceX data-delivery sub-system and an awkward array sub-system cooperating to generate requested data or plots. The ServiceX system runs against ATLAS xAOD data and flat ROOT TTree’s and awkward on the columnar data produced by ServiceX.
more » « less
Full Text Available
FuncADL: Functional Analysis Description Language

https://doi.org/10.1051/epjconf/202125103068

Proffitt, Mason; Watts, Gordon (January 2021, EPJ Web of Conferences)
Biscarat, C.; Campana, S.; Hegner, B.; Roiser, S.; Rovelli, C.I.; Stewart, G.A. (Ed.)
The traditional approach in HEP analysis software is to loop over every event and every object via the ROOT framework. This method follows an imperative paradigm, in which the code is tied to the storage format and steps of execution. A more desirable strategy would be to implement a declarative language, such that the storage medium and execution are not included in the abstraction model. This will become increasingly important to managing the large dataset collected by the LHC and the HL-LHC. A new analysis description language (ADL) inspired by functional programming, FuncADL, was developed using Python as a host language. The expressiveness of this language was tested by implementing example analysis tasks designed to benchmark the functionality of ADLs. Many simple selections are expressible in a declarative way with FuncADL, which can be used as an interface to retrieve filtered data. Some limitations were identified, but the design of the language allows for future extensions to add missing features. FuncADL is part of a suite of analysis software tools being developed by the Institute for Research and Innovation in Software for High Energy Physics (IRIS-HEP). These tools will be available to develop highly scalable physics analyses for the LHC.
more » « less
Full Text Available
Open is not enough

https://doi.org/10.1038/s41567-018-0342-2

Chen, Xiaoli; Dallmeier-Tiessen, Sünje; Dasler, Robin; Feger, Sebastian; Fokianos, Pamfilos; Gonzalez, Jose Benito; Hirvonsalo, Harri; Kousidis, Dinos; Lavasa, Artemis; Mele, Salvatore; et al (February 2019, Nature Physics)

Full Text Available
Searching for long-lived particles beyond the Standard Model at the Large Hadron Collider

https://doi.org/10.1088/1361-6471/ab4574

Alimena, Juliette; Beacham, James; Borsato, Martino; Cheng, Yangyang; Vidal, Xabier Cid; Cottin, Giovanna; Curtin, David; De Roeck, Albert; Desai, Nishita; Evans, Jared A; et al (September 2020, Journal of Physics G: Nuclear and Particle Physics)
null (Ed.)
Full Text Available
A Roadmap for HEP Software and Computing R&D for the 2020s

https://doi.org/10.1007/s41781-018-0018-8

Albrecht, Johannes; Alves, Antonio Augusto; Amadio, Guilherme; Andronico, Giuseppe; Anh-Ky, Nguyen; Aphecetche, Laurent; Apostolakis, John; Asai, Makoto; Atzori, Luca; Babik, Marian; et al (December 2019, Computing and Software for Big Science)

Full Text Available
Operation and performance of the ATLAS semiconductor tracker in LHC Run 2

https://doi.org/10.1088/1748-0221/17/01/P01013

Aad, Georges; Abbott, Brad; Abbott, Dale Charles; Abed Abud, Adam; Abeling, Kira; Abhayasinghe, Deshan Kavishka; Abidi, Syed Haider; Aboulhorma, Asmaa; Abramowicz, Halina; Abreu, Henso; et al (January 2022, Journal of Instrumentation)

Abstract The semiconductor tracker (SCT) is one of the tracking systems for charged particles in the ATLAS detector. It consists of 4088 silicon strip sensor modules.During Run 2 (2015–2018) the Large Hadron Collider delivered an integrated luminosity of 156 fb -1 to the ATLAS experiment at a centre-of-mass proton-proton collision energy of 13 TeV. The instantaneous luminosity and pile-up conditions were far in excess of those assumed in the original design of the SCT detector.Due to improvements to the data acquisition system, the SCT operated stably throughout Run 2.It was available for 99.9% of the integrated luminosity and achieved a data-quality efficiency of 99.85%.Detailed studies have been made of the leakage current in SCT modules and the evolution of the full depletion voltage, which are used to study the impact of radiation damage to the modules.
more » « less
Full Text Available

Search for: All records